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Abstract 

We propose Range and Roots which are two common patterns useful for spec- 
ifying a wide range of counting and occurrence constraints. We design specialised 
propagation algorithms for these two patterns. Counting and occurrence constraints 
specified using these patterns thus directly inherit a propagation algorithm. To illus- 
trate the capabilities of the Range and ROOTS constraints, we specify a number of 
global constraints taken from the literature. Preliminary experiments demonstrate 
that propagating counting and occurrence constraints using these two patterns leads 
to a small loss in performance when compared to specialised global constraints and 
is competitive with alternative decompositions using elementary constraints. 



"This paper is a compilation and an extension of 1101 . 1111 . and 1121 . The first author was supported 
by the ANR project ANR-06-BLAN-0383-02. 
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1 Introduction 



Global constraints are central to the success of constraint programming [28]. Global 
constraints allow users to specify patterns that occur in many problems, and to exploit 
efficient and effective propagation algorithms for pruning the search space. Two com- 
mon types of global constraints are counting and occurrence constraints. Occurrence 
constraints place restrictions on the occurrences of particular values. For instance, we 
may wish to ensure that no value used by one set of variables occurs in a second set. 
Counting constraints, on the other hand, restrict the number of values or variables meet- 
ing some condition. For example, we may want to limit the number of distinct values 
assigned to a set of variables. Many different counting and occurrences constraints have 
been proposed to help model a wide range of problems, especially those involving re- 
sources (see, for example, [231 HI 121)1 151 [5]). 

In this paper, we will show that many such constraints can be specified by means of 
two new global constraints, Range and ROOTS together with some standard elementary 
constraints like subset and set cardinality. These two new global constraints capture 
the familiar notions of image and domain of a function. Understanding such notions 
does not require a strong background in constraint programming. A basic mathematical 
background is sufficient to understand these constraints and use them to specify other 
global constraints. We will show, for example, that Range and ROOTS are versatile 
enough to allow specification of open global constraints, a recent kind of global constraints 
for which the set of variables involved is not known in advance. 

Specifications made with Range and Roots constraints are executable. We show 
that efficient propagators can be designed for the Range and Roots constraints. We give 
an efficient algorithm for propagating the Range constraint based on a flow algorithm. 
We also prove that it is intractable to propagate the Roots constraint completely. We 
therefore propose a decomposition of the Roots constraint that can propagate it partially 
in linear time. This decomposition does not destroy the global nature of the Roots 
constraint as in many situations met in practice, it prunes all possible values. The 
proposed propagators can easily be incorporated into a constraint toolkit. 

We show that specifying a global constraint using Range and Roots provides us with 
an reasonable method to propagate counting and occurrence constraints. There are three 
possible situations. In the first, the global nature of the Range and Roots constraints 
is enough to capture the global nature of the given counting or occurrence constraint, 
and propagation is not hindered. In the second situation, completely propagating the 
counting or occurrence constraint is NP-hard. We must accept some loss of propaga- 
tion if propagation is to be tractable. Using Range and ROOTS is then one means to 
propagate the counting or occurrence constraint partially. In the third situation, the 
global constraint can be propagated completely in polynomial time but using Roots 
and Range hinders propagation. In this case, if we want to achieve full propagation, we 
need to develop a specialised propagation algorithm. 

We also show that decomposing occurrence constraints and counting constraints using 
the Range and Roots constraints performs well in practice. Our experiments on random 
CSPs and a on real world problem from CSPLib demonstrate that propagating counting 
and occurrence constraints using the Range and Roots constraints leads to a small loss 
in performance when compared to specialised global constraints and is competitive with 
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alternative decompositions into more elementary constraints. 

The rest of the paper is organised as follows. Section [2] gives the formal background. 
Section [3] defines the Range and ROOTS constraints and gives a couple of examples 
to illustrate how global constraints can be decomposed using these two constraints. In 
Section 21 we propose a polynomial algorithm for the Range constraint. In Section [5l we 
give a complete theoretical analysis of the ROOTS constraint and our decomposition of it, 
and we discuss implementation details. Section [6] gives many examples of counting and 
occurrence constraints that can be specified using the Range and ROOTS constraints. 
Experimental results arc presented in Section [7] Finally, we end with conclusions in 
Section [S] 

2 Formal background 

A constraint satisfaction problem consists of a set of variables, each with a finite domain 
of values, and a set of constraints specifying allowed combinations of values for subsets of 
variables. We use capitals for variables (e.g. X, Y and S), and lower case for values (e.g. 
v and w). We write D(X) for the domain of a variable X. A solution is an assignment of 
values to the variables satisfying the constraints. A variable is ground when it is assigned 
a value. We consider both integer and set variables. A set variable S is often represented 
by its lower bound lb(S) which contains the definite elements (that must belong to the 
set) and an upper bound ub(S) which also contains the potential elements (that may or 
may not belong to the set). 

Constraint solvers typically explore partial assignments enforcing a local consistency 
property using either specialised or general purpose propagation algorithms. Given a 
constraint C, a bound support on C is a tuple that assigns to each integer variable a 
value between its minimum and maximum, and to each set variable a set between its 
lower and upper bounds which satisfies C. A bound support in which each integer 
variable is assigned a value in its domain is called a hybrid support. If C involves only 
integer variables, a hybrid support is a support. A value (resp. set of values) for an 
integer variable (resp. set variable) is bound or hybrid consistent with C iff there exists 
a bound or hybrid support assigning this value (resp. set of values) to this variable. A 
constraint C is bound consistent (BC) iff for each integer variable Xi, its minimum and 
maximum values belong to a bound support, and for each set variable Sj, the values in 
ub(Sj) belong to Sj in at least one bound support and the values in lb(Sj) are those from 
ub(Sj) that belong to Sj in all bound supports. A constraint C is hybrid consistent (HC) 
iff for each integer variable Xi, every value in D(Xi) belongs to a hybrid support, and 
for each set variable Sj, the values in ub(Sj) belong to Sj in at least one hybrid support, 
and the values in lb(Sj) are those from ub(Sj) that belong to Sj in all hybrid supports. 
A constraint C involving only integer variables is generalised arc consistent (GAC) iff 
for each variable Xi, every value in D(Xi) belongs to a support. If all variables in C 
are integer variables, hybrid consistency reduces to generalised arc consistency, and if all 
variables in C are set variables, hybrid consistency reduces to bound consistency. 

To illustrate these different concepts, consider the constraint C(X\,X 2 , T) that holds 
iff the set variable T is assigned exactly the values used by the integer variables X\ and 
X 2 . Let D{X 1 ) = {1,3}, D(X 2 ) = {2,4}, lb(T) = {2} and ub(T) = {1,2,3,4}. BC 
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does not remove any value since all domains are already bound consistent (value 2 was 
considered as possible for X\ because BC deals with bounds). On the other hand, HC 
removes 4 from D{X2) and from ub{T) as there does not exist any tuple satisfying C in 
which X2 does not take value 2. 

We will compare local consistency properties applied to (sets of) logically equivalent 
constraints, ci and c-i. As in [T7], a local consistency property $ on c\ is as strong as 
on C2 iff, given any domains, if $ holds on c\ then "J holds on C2; $ on c\ is stronger 
than W on C2 iff $ on Ci is as strong as VP on C2 but not vice versa; $ on ci is equivalent 
to ^ on C2 iff $ on ci is as strong as ^ on C2 and vice versa; $ on c\ is incomparable to 
\& on C2 iff <£■ on c\ is not as strong as \& on C2 and vice versa. 

A total function T from a source set S into a target set T is denoted by : S — ► T. 
The set of all elements in S that have the same image j € T is JF _1 (j) = {i : ^"(i) = j}. 
The image of a set 5 C S under T is .F(iS) = Uies^X*)' wnus t the domain of a set 
T C T under T is J r_1 (T) = UigT*^* Throughout, we will view a set of integer 
variables, A"i to X n as a function A" : {l,..,n} — > Ui=i ^(Ai). That is, Af(i) is the 
value of Aj. 

3 Two useful patterns: RANGE and ROOTS 

Many counting and occurrence constraints can be specified using simple non-global con- 
straints over integer variables (like A < to), simple non-global constraints over set vari- 
ables (like Si C S2 or \S\ = k) available in most constraint solvers, and two special global 
constraints acting on sequences of variables: Range and ROOTS. Range captures the 
notion of image of a function and Roots captures the notion of domain. Specifica- 
tion with Range and ROOTS is executable. It permits us to decompose other global 
constraints into more primitive constraints. 

Given a function X representing a set of integer variables, A x to A n , the Range 
constraint holds iff a set variable T is the image of another set variable S under X. 

Range([Ai,..,A„],S',T) & T = X(S) (that is,T= {X t \ i € S}) 

The Roots constraint holds iff a set variable S is the domain of the another set variable 
T under X. 

Roots([Ai,...,A„],S',T) iff S = X- 1 {T) (that is, S = {i \ X, e T}) 

Range and Roots are not exact inverses. A Range constraint can hold, but the corre- 
sponding Roots constraint may not, and vice versa. For instance, Range([1, 1], {1}, {1}) 
holds but not Roots([1, 1], {1}, {1}) since = {1, 2}, and Roots([1, 1, 1], {1, 2, 3}, {I, 2}) 

holds but not Range([1, 1, 1], {1, 2, 3}, {1, 2}) as no Aj is assigned to 2. 

Before showing how to propagate Range and ROOTS efficiently, we give two examples 
that illustrate how some counting and occurrence global constraints from |J| can be 
specified using Range and Roots. 

The NValue constraint counts the number of distinct values used by a sequence of 
variables [2H El [7] . NValueQAi, .., X n ],N) holds iff N = \{X t | 1 < i < n}\. A way to 
implement this constraint is with a Range constraint: 
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NVALUE([X 1 ,..,X„],iV) iff 
RANGE([Xi,..,X n ],{l,..,n},T) A |T|=JV 

The AtMost constraint is one of the oldest global constraints [32]. The AtMost 
constraint puts an upper bound on the number of variables using a particular value. 
AtMostQXl, ..,X n ],d,N) holds iff \{i \ X t = d}\ < N. It can be decomposed using a 
Roots constraint. 

ATM.OST([Xi,..,X n ],d,N) iff 
BooTa([Xi,..,X n ],S,{d}) A \S\<N 

These two examples show that it can be quite simple to decompose global constraints 
using Range and Roots. As we will show later, some other global constraints will 
require the use of both Range and ROOTS in the same decomposition. The next sections 
show how Range and Roots can be propagated efficiently. 

4 Propagating the RANGE constraint 

Enforcing hybrid consistency on the Range constraint is polynomial. This can be done 
using a maximum network flow problem. In fact, the Range constraint can be decom- 
posed using a global cardinality constraint (Gcc) for which propagators based on flow 
problems already exist [551 (Ml- But the Range constraint does not need the whole 
power of maximum network flow problems, and thus HC can be enforced on it at a lower 
cost than that of calling a Gcc propagator. In this section, we propose an efficient way 
to enforce HC on Range. To simplify the presentation, the use of the flow is limited to 
a constraint that performs only part of the work needed for enforcing HC on Range. 
This constraint, that we name OCCURS ([Xl, . . . ,X n ],T), ensures that all the values in 
the set variable T are used by the integer variables X\ to X n : 

Occurs([Xi,...,X„],T) iff T C X({l..n}) (that is, T C {Xi \ i e l..n}) 

We first present an algorithm for achieving HC on OCCURS (Section 14. ip . and then 
use this to propagate the Range constraint fSection l4.2p . 

4.1 Hybrid consistency on OCCURS 

We achieve HC on Occurs([Xi, . . . , X n ],T) using a network flow. 

4.1.1 Building the network flow 

We use a unit capacity network [1] in which capacities between two nodes can only be or 
1. This is represented by a directed graph where an arc from node x to node y means that 
a maximum flow of 1 is allowed between x and y while the absence of an arc means that 
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Figure 1: Unit capacity network of the constraint C — Occurs([Xi, X 2 , -X3], T) with 
D{X X ) = {1,2}, D(X 2 ) = {2,3,4}, D(X 3 ) = {3,4}, lb(T) = {3,4} and ub(T) = 
{1,2,3,4}. Arcs are directed from left to right. 



the maximum flow allowed is 0. The unit capacity network Gc = (A, E) of the constraint 
C = OccURS([Xi, . . . , X n ],T) is built in the following way. N = {s} U Nt U N 2 U {t}, 
where s is a source node, t is a sink node, Ni — {v \ v £ \J D(Xi)} and N 2 — {z v \ v £ 
U £>(Xj)} U {xj I i 6 [I..71]}. The set of arcs E is as follows: 

E = ({s} x N{) U {(v, z v ),Vv i lb(T)} U {{v, x t ) \ v £ £>pQ)} U (A 2 x {<}) 

G c is quadripartite, i.e., £ C ({s} x iVi) U (N t x Ar 2 ) U (JV 2 x {t}). In Fig. [U we depict 
the network G c of the constraint C = OccursQXi, X 2 ,X 3 ],T) with D{X X ) = {1,2}, 
£>(X 2 ) = {2, 3, 4}, D(X 3 ) = {3, 4}, lb(T) = {3, 4} and ub(T) = {1, 2, 3, 4}. The intuition 
behind this graph is that when a flow uses an arc from a node v to a node Xi this means 
that Xi is assigned u, and when a flow uses the arc (v,z v ) this means that v is not 
necessarily used by the Xi'sQ In Fig. Q] nodes 3 and 4 are linked only to nodes x 2 and 
X3, that is, values 3 and 4 must necessarily be taken by one of the variables Xi (3 and 
4 belong to lb(T)). On the contrary, nodes 1 and 2 are also linked to nodes z\ and z 2 
because values 1 and 2 do not have to be taken by a Xi (they are not in lb(T)). 

In the particular case of unit capacity networks, a flow is any set E' C E: any arc in 
E' is assigned 1 and the arcs in E\ E' are assigned 0. A feasible flow from s to t in Gc is 
a subset Ef of E such that Vn S 7V\{s, t} the number of arcs of Ef entering n is equal to 
the number of arcs of Ef going out of n, that is, \{(n' , n) g _E/}| = |{(n, n") e The 
value of the flow Ef from s to £, denoted val(Ef, s, t), is val(Ef,s, t) — \{n \ (s, n) £ 
A maximum flow from s to £ in Gc is a feasible flow -Em such that there does not exist 
a feasible flow Ef, with val(Ef,s,t) > val(EM, s,t). A maximum flow for the network 
of Fig. [T] is given in Fig. [5] By construction a feasible flow cannot have a value greater 
than |A^i| and cannot contain two arcs entering a node Xi from A^. Hence, we can define 
a function ip linking feasible flows and partial instantiations on the X^s. Given any 
feasible flow Ef from s to t in Gc, ^{Ef) — {(Xi,v) \ (v,Xi) £ Ef}. The maximum flow 

1 Note that in our presentation of the graph, the edges go from the nodes representing the values to 
the nodes representing the variables. This is the opposite to the direction used in the presentation of 
network flows for propagators of the AllDifferent or Gcc constraints |25II26I . 
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Figure 2: A maximum flow for the network of Fig. [T] Bold arcs are those that belong to 
the flow. Arcs are directed from left to right. 

in Fig. [5] corresponds to the instantiation X2 = 4,^3 = 3. The way Gc is built induces 
the following theorem. 

Theorem 1 Let Gc = (N, E) be the capacity network of a constraint C = Occurs([Ai, . . . , X n ],T). 

1. A value v in the domain D{Xi) for some i G [l..n] is HC iff there exists a flow Ef 
from s to t in Gc with val(Ef, s, t) = \N\\ and {v, Xi) G Ef 

2. If the Xi 's are HC, T is HC iff ub(T) C (J i D(Xi) 

Proof. (l-=>) Let / be a solution for C with (Xi,v) G /. Build the following flow H: 
Put (v,Xi) in H; yw G I[T],w ^ v, take a variable Xj such that {Xj,w) G / (we know 
there is at least one since I is solution), and put (w,Xj) in H; \fw' (£ I[T],w' 7^ v, add 
(«/, z w i) to H . Add to H the edges from s to Ni and from N2 to t so that we obtain a 
feasible flow. By construction, all w G Ni belong to an edge of H . So, val(H, s, t) = \Ni\ 
and H is a maximum flow with (v, Xi) G H. 

(1.4=) Let Em be a flow from s to t in Gc with (v,Xi) G Em and vol {Em , s,t) 
= \Ni\. By construction of Gc, we are guaranteed that all nodes in Ni belong to an arc 
in^Mn(iViXiV2), and that for every value w G lb{T), {y \ {w,y) G E} C {xi \ i G [l..n]}. 
Thus, for each w G lb{T), 3Xj \ {Xj, w) G ip(Em)- Hence, any extension of ip{Em) where 
each unassigned Xj takes any value in D{Xj) and T = lb(T) is a solution of C with 
Xi = v . 

(2.=>) If T is HC, all values in ub(T) appear in at least one solution tuple. Since C 
ensures that T C ub{T) cannot contain a value appearing in none of the D{Xi). 

(2.<=) Let ub{T) C |J. D(Xi). Since all X^s are HC, we know that each value v in 
|J 4 D(Xi) is taken by some Xi in at least one solution tuple /. Build the tuple I' so that 
I'[Xi] = I[Xi] for each i G [l..n] and I'[T] = I[T] U {v}. V is still solution of C. So, 
ub(T) is as tight as it can be wrt HC. In addition, since all Xj's are HC, this means that 
in every solution tuple /, for each v G lb{T) there exists i such that I[Xi] = v. So, lb{T) 
is HC. □ 

Following Theorem [1] we need a way to check which edges belong to a maximum 
flow. Residual graphs are useful for this task. Given a unit capacity network Gc and 
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x 3 

Figure 3: Residual graph obtained from the network in Fig. [T]and the maximum flow in 

Fig. m 

a maximal flow Em from s to t in Gc, the residual graph Rq q {Em) = {N,Er) is the 
directed graph obtained from Gc by reversing all arcs belonging to the maximum flow 
E M \ that is, E R = {(x,y) 6 E\ E M } U {{y,x) \ (x, y) £ En E M }- Given the network 
Gc of Fig. Q] and the maximum flow Em of Fig. [21 Rg c (Em) is depicted in Fig. [5] 
Given a maximum flow Em from s to t in Gc, given (x, y) G Ni x N2 D E \ Em, there 
exists a maximum flow containing (x, y) iff (x, y) belongs to a cycle in Rq c (Em) [29) . 
Furthermore, finding all the arcs (x, y) that do not belong to a cycle in a graph can be 
performed by building the strongly connected components of the graph. We see in Fig. 
[3]that the arcs (l,xi) and (2,xi) belong to a cycle. So, they belong to some maximum 
flow and {X\, 1) and (X\, 2) are hybrid consistent. (2, X2) does not belong to any cycle. 
So, (X 2 ,2) is not HC. 

4.1.2 Using the network flow for achieving HC on OCCURS 

We now have all the tools for achieving HC on any OCCURS constraint. We first build Gc- 
We compute a maximum flow Em from s to t in Gc; if vciI(Em, s,t) < \N\\, we fail. Oth- 
erwise we compute Rc c (Em), build the strongly connected components in Rq c (Em), 
and remove from D(Xi) any value v such that (i>,Xj) belongs to neither Em nor to a 
strongly connected component in Rg c {Em)- Finally, we set ub(T) to ub(T) n D{X{). 
Following Theorem Q] and properties of residual graphs, this algorithm enforces HC on 
OccURS([Xi,..,X n ],T). 

Complexity. Building Gc is in 0(nd) where d is the maximum domain size. We need 
then to find a maximum flow Em in Gc- This can be done in two sub-steps. First, we 
use the arc (v,z v ) for each v ^ lb(T) (in 0(\{J i D(X i )\)). Afterwards, we compute a 
maximum flow on the subgraph composed of all paths traversing nodes w with w € lb(T) 
(because there is no arc (w,z w ) in Gc for such w). The complexity of finding a maxi- 
mum flow in a unit capacity network is in 0(y/k • e) if k is the number of nodes and e 
the number of edges. This gives a complexity in 0{<J\lb(T)\ ■ \lb(T) \ ■ n) for this second 
sub-step. Building the residual graph and computing the strongly connected components 
is in 0(nd). Extracting the HC domains for the X^s is direct. There remains to compute 
BC on T, which takes Q(nd). Therefore, the total complexity is in 0(nd + n ■ |^6(T)| 3 / 2 ). 
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Algorithm 1: Hybrid consistency on RANGE 

procedure Propag-Range([Xi, . . . ,X n ],S, T); 

1 Introduce the set of integer variables Y = {Y; | i £ ub(S)}, 
with D(Yi) = D(Xi) U {dummy}; 

2 Achieve hybrid consistency on the constraint Occurs(Y, T); 

3 Achieve hybrid consistency on the constraints i £ S <-> 7, 6 T, for all £ Y; 

4 Achieve GAC on the constraints (Y; = dummy) V (Y; = -Xj), for all Yi & Y; 



Incrementality. In constraint solvers, constraints are usually maintained in a locally 
consistent state after each modification (restriction) of the domains of the variables. It 
is thus interesting to consider the total complexity of maintaining HC on OCCURS after 
an arbitrary number of restrictions on the domains (values removed from D(Xi) and 
ub(T), or added to lb(T)) as we descend a branch of a backtracking search tree. Whereas 
some constraints are completely incremental (i.e., the total complexity after any number 
of restrictions is the same as the complexity of one propagation), this is not the case 
for constraints based on flow techniques like AllDifferent or Gcc [25l [26]. They 
potentially require the computation of a new maximum flow after each modification. 
Restoring a maximum flow from one that lost p edges is in 0(p ■ e). If values are removed 
one by one (nd possible times), and if each removal affects the current maximum flow, 
the overall complexity over a sequence of restrictions on AYs, S, T, is in 0(n 2 d 2 ). 

4.2 Hybrid consistency on RANGE 

Enforcing HC on Range([Xl, . . . , X n ], S, T) can be done by decomposing it as an OCCURS 
constraint on new variables Yi and some channelling constraints (|16j) linking T and the 
Yj's to S and the AVs. Interestingly, we do not need to maintain HC on the decomposi- 
tion but just need to propagate the constraints in one pass. 

The algorithm Propag-Range, enforcing HC on the Range constraint, is presented in 
Algorithm [T] In line[IJ a special encoding is built, where a Yi is introduced for each Xi 
with index in ub(S). The domain of a Yi is the same as that of Xi plus a dummy value. 
The dummy value works as a flag. If OCCURS prunes it from D(Yi) this means that Yi is 
necessary in OCCURS to cover lb(T). Then, Xi is also necessary to cover lb(T) in Range. 
In line [U HC on OCCURS removes a value from a Yi each time it contains other values 
that are necessary to cover lb(T) in every solution tuple. HC also removes values from 
ub(T) that cannot be covered by any Yi in a solution. Line [T] updates the bounds of S 
and the domain of Y^s. Finally, in lineHl the channelling constraints between Yi and Xi 
propagate removals on Xi for each i which belongs to S in all solutions. 

Theorem 2 The algorithm Propag-Range is a correct algorithm for enforcing HC on 
RANGE, that runs in 0(nd + n ■ \lb(T)\ 3 / 2 ) time, where d is the maximal size of Xi 
domains. 

Proof. Soundness. A value v is removed from D(Xi) in line [T] if it is removed from Yi 
together with dummy in lines [T] or [1] If a value v is removed from Yi in line[Tl this means 
that any tuple on variables in Y covering lb(T) requires that Yi takes a value from D(Yi) 
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other than v. So, we cannot find a solution of Range in which Xj = v since lb(T) must 
be covered as well. A value v is removed from D{Yi) in lineUJif i £ lb(S) and v £ ub(T). 
In this case, Range cannot be satisfied by a tuple where Xj — v. If a value v is removed 
from ub(T) in line [JJ none of the tuples of values for variables in Y covering lb(T) can 
cover v as well. Since variables in Y duplicate variables Xj with index in ub(S), there is 
no hope to satisfy Range if v is in T. Note that ub(T) cannot be modified in line [1] since 
Y contains only variables Yi for which i was in ub(S). If a value v is added to lb(T) in 
line[TJ this is because there exists i in lb(S) such that D(Yi) n ub(T) — {v}. Hence, v is 
necessarily in T in all solutions of Range. An index i can be removed from ub(S) only 
in line[TJ This happens when the domain of Yi does not intersect ub(T). In such a case, 
this is evident that a tuple where i £ S could not satisfy Range since Xi could not take 
a value in T. Finally, if an index i is added to lb(S) in linc[T] this is because D(Yi) is 
included in lb(T), which means that the dummy value has been removed from D(Yi) in 
line[TJ This means that Yi takes a value from lb(T) in all solutions of OCCURS. Xi also 
has to take a value from lb(T) in all solutions of Range. 

Completeness Suppose that a value v is not pruned from D{Xi) after line [1] of Propag- 
Range. If Yi £ Y , we know that after line [JJ there was an instantiation I on Y and 
T, solution of OCCURS with I[Yi] — v or with lj = dummy (thanks to the channelling 
constraints in line[TJ). We can build the tuple I' on X\, ..X n ,S,T where Xj takes value 
v, every Xj with j € ub(S) and /[Y^-] € J[T] takes J[Yj*], and the remaining Xj's take 
any value in their domain. T is set to 7[T] plus the values taken by Xj's with j £ lb(S). 
These values are in ub(T) thanks to line [TJ Finally, S is set to lb(S) plus the indices of 
the Yj's with I[Yj] £ I[T], These indices are in ub(S) since the only j's removed from 
ub(S) in lineQ]are such that D(Yj) n ub(T) — 0, which prevents /[ij] from taking a value 
in I[T}. Thus 7' is a solution of Range with /'[Xj] = v. We have proved that the Xj's 
are hybrid consistent after Propag-Range. 

Suppose a value i £ ub(S) after line[TJ Thanks to constraint in line [JJ we know there 
exists v in D(Yi) n ub(T), and so, v £ D{Xj) n ub(T). Now, X, is hybrid consistent 
after line[T] Thus Xi — v belongs to a solution of Range. If we modify this solution by 
putting i in S and v in T (if not already there), we keep a solution. 

Completeness on lb(S), lb(T) and ub(T) is proved in a similar way. 
Complexity. The important thing to notice in Propag-Range is that constraints in lines 
[JJ[TJare propagated in sequence. Thus, OCCURS is propagated only once, for a complexity 
in 0(nd + n- \lb(T)\ 3 / 2 ). Lines[TJ[TJ andUJare in 0(nd). Thus, the complexity of Propag- 
Range is in 0(nd + n ■ \lb(T)\ 3 ^ 2 ). This reduces to linear time complexity when lb(T) is 
empty. 

Incrementality. The overall complexity over a sequence of restrictions on Xj's, S and T 
is in 0(n 2 d 2 ). (See incrementality of OCCURS in Section E3~T1 ) □ 

Note that the Range constraint can be decomposed using the Gcc constraint. How- 
ever, propagation on such a decomposition is in 0(n 2 d+n 2 66 ) time complexity (see |24) ) . 
Propag-Range is thus significantly cheaper. 
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5 Propagating the ROOTS constraint 



We now give a thorough theoretical analysis of the ROOTS constraint. In Section [O] 
we provide a proof that enforcing HC on Roots is NP-hard in general. Section 15.21 
presents a decomposition of the Roots constraint that permits us to propagate the 
ROOTS constraint partially in linear time. Section 15.31 shows that in many cases this 
decomposition does not destroy the global nature of the ROOTS constraint as enforcing 
HC on the decomposition achieves HC on the Roots constraint. Section 15.41 shows 
that we can obtain BC on the Roots constraint by enforcing BC on its decomposition. 
Finally, we provide some implementation details in Section [5.51 

5.1 Complete propagation 

Unfortunately, propagating the ROOTS constraint completely is intractable in general. 
Whilst we made this claim in [10], a proof has not yet been published. For this reason, 
we give one here. 

Theorem 3 Enforcing HC on the Roots constraint is NP-hard. 

Proof. We transform 3Sat into the problem of the existence of a solution for Roots. 
Finding a hybrid support is thus NP-hard. Hence enforcing HC on Roots is NP-hard. 
Let (p = {ci, . . . , c m } be a 3CNF on the Boolean variables x\, . . . ,x n . We build the con- 
straint Roots([Xi, . . . , X n + m ], S,T) as follows. Each Boolean variable Xi is represented 
by the variable X, with domain D(Xt) = {i,— i}. Each clause c p — Xi V ~^Xj V Xk is 
represented by the variable X n+p with domain D(X n+p ) — {i,—j,k}. We build S and 
T in such a way that it is impossible for both the index i of a Boolean variable Xi and 
its complement —i to belong to T. We set lb(T) = and ub(T) = U™=i{*'~ *}> an< ^ 
lb(S) = ub(S) = {n + 1, . . . , n + m}. An interpretation M on the Boolean variables 
X\, . . . , x n is a model of <p iff the tuple r in which t[JQ] = i iff M[xj] = can be extended 
to a solution of ROOTS. (This extension puts in T value i iff M[:Ej] = 1 and assigns X n+p 
with the value corresponding to the literal satisfying c p in M.) □ 

We thus have to look for a lesser level of consistency for Roots or for particular cases 
on which HC is polynomial. We will show that bound consistency is tractable and that, 
under conditions often met in practice (e.g. one of the last two arguments of ROOTS is 
ground), enforcing HC is also. 

5.2 A decomposition of Roots 

To show that Roots can be propagated tractably, we will give a straightforward decom- 
position into ternary constraints that can be propagated in linear time. This decompo- 
sition does not destroy the global nature of the ROOTS constraint since enforcing HC on 
the decomposition will, in many cases, achieve HC on the original Roots constraint, and 
since in all cases, enforcing BC on the decomposition achieves BC on the original ROOTS 
constraint. Given Roots([Xi, .., X n ], S, T), we decompose it into the implications: 

i G S -> Xi eT 
l,eT -> ieS 
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where i £ [l..n]. We have to be careful how we implement such a decomposition in a 
constraint solver. First, some solvers will not achieve HC on such constraints (see Sec 
15.51 for more details). Second, we need an efficient algorithm to be able to propagate the 
decomposition in linear time. As we explain in more detail in Sec 15.51 a constraint solver 
could easily take quadratic time if it is not incremental. 

We first show that this decomposition prevents us from propagating the ROOTS con- 
straint completely. However, this is to be expected as propagating Roots completely is 
NP-hard and this decomposition is linear to propagate. In addition, as we later show, in 
many circumstances met in practice, the decomposition does not in fact hinder propaga- 
tion. 

Theorem 4 HC on Roots([Xl, .., X n ], S, T) is strictly stronger than HC on i £ S — > 
Xi £ T, and Xi S T — > i € S for all i £ [l..n]. 

Proof. Consider X x £ {1,2}, X 2 £ {3,4}, X 3 £ {1,3}, X 4 G {2,3}, lb(S) = ub(S) = 
{3, 4}, lb(T) = 0, and ub(T) = {1,2, 3, 4}. The decomposition is HC. However, enforcing 
HC on Roots will prune 3 from D(X 2 ). □ 

In fact, enforcing HC on the decomposition achieves a level of consistency between 
BC and HC on the original Roots constraint. Consider X\ £ {1,2,3}, X 2 £ {1,2,3}, 
lb(S) = ub{S) = {1,2}, lb(T) = {}, and ub{T) = {1,3}. The ROOTS constraint is BC. 
However, enforcing HC on the decomposition will remove 2 from the domains of Xi and 
X 2 . In the next section, we identify exactly when the decomposition achieves HC on 
Roots. 

5.3 Some special cases 

Many of the counting and occurrence constraints do not use the ROOTS constraint in its 
more general form, but have some restrictions on the variables S, T or Xj's. For example, 
it is often the case that T or S are ground. We select four important cases that cover 
many of these uses of ROOTS and show that enforcing HC on Roots is then tractable. 

CI. Mi e lb(S),D(Xi) C lb(T) 

C2. Vi i ub(S), D{Xi) n ub(T) = 

C3. Xt, ..,X n are ground 

C4. T is ground 

We will show that in any of these cases, we can achieve HC on Roots simply by propa- 
gating the decomposition. 

Theorem 5 If one of the conditions CI to CA holds, then enforcing HC on i £ S — > 
Xi £ T, and X t £ T — > i £ S for all i £ [l..n] achieves HC on Roots([Xl, .., X n ],S, T). 

Proof. Our proof will exploit the following properties that are guaranteed to hold when 
we have enforced HC on the decomposition. 
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PI if D(Xi) C lb(T) then i e J6(S) 

P2 if D(X l ) n u6(T) = then i £ u6(S) 

P3 if i g J6(S) then £>(X;) C «6(T) 

P4 if i $ ub(S) then D(X t ) n J6(T) = 

P5 if D(JSQ) = {«} and i E lb(S) then v E lb(T) 

P6 if D(Xi) = {v} and i ub(S) then w ^ ub(T) 

P7 if i is added to lb(S) by the constraint X r E T -> i e 5 then C Z6(T) 

P8 if i is deleted from by the constraint i E S ^ X { E T then L>(X;) n ub(T) = 

Soundness. Immediate. 

Completeness. We assume that one of the conditions CI — C4 holds and the decom- 
position is HC. We will first prove that the Roots constraint is satisfiable. Then, we 
will prove that, for any Xi, all the values in D(X{) belong to a solution of Roots, and 
that the bounds on S and T are as tight as possible. 

We prove that the ROOTS constraint is satisfiable. Suppose that one of the conditions 
CI — C4 holds and that the decomposition is HC. Build the following tuple t of values 
for the Xi, S, and T. Initialise t[S] and t[T] with lb(S) and lb(T) respectively. Now, let 
us consider the four conditions separately. 

(CI) For each i E t[S], choose any value v in D{Xi) for tLYj]. From the assumption 
and from property P7 we deduce that v is in lb(T), and so in r[T]. For each other i, 
assign Xi with any value in D{Xi) \ lb(T). (This set is not empty thanks to property 
PI.) t obviously satisfies Roots. 

(C2) For each i E t[S], choose any value in D{Xi) for r[Xj]. By construction such a 
value is in ub(T) thanks to property P3. If necessary, add r[Xi) to t[T}. For each other 
i E ub(S), assign Xi with any value in D(Xi)\r[T] if possible. Otherwise assign Xi with 
any value in D(Xi) and add i to t[S]. For each i ^ ub(S), assign Xi any value from its 
domain. By assumption and by property P8 we know that D(Xi) n ub(T) = 0. Thus, r 
satisfies Roots. 

(C3) r[Xi] is already assigned for all X,. For each i E t[S], property P5 tells us that 
r[Xi] is in r[T], and for each i ^ lb(S), property PI tells us that r[Xi] is outside lb(T). 
t satisfies Roots. 

(C4) For each i E t[S] choose any value v in D(Xi) for r[Xi]. Property P3 tells us 
v E ub(T). By assumption, v is thus in t[T]. For each i outside ub(S), assign Xi with 
any value v in D(Xi). (v is outside t[T] by assumption and property P4). For each other 
i, assign Xi with any value in D(Xi) and update t[S] if necessary, r satisfies Roots. 

We have proved that the ROOTS constraint has a solution. We now prove that for 
any value in ub(S) or in ub{T) or in D(Xi) for any Xi, we can transform the arbitrary 
solution of ROOTS into a solution that contains that value. Similarly, for any value not 
in lb(S) or not in lb(T), we can transform the arbitrary solution of Roots into a solution 
that does not contain that value. 

Let us prove that lb(T) is tight. Suppose the tuple r is a solution of the Roots 
constraint. Let v E' lb(T) and v E t[T]. We show that there exists a solution with 
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v $l t[T]. (Remark that this case is irrelevant to condition C4.) We remove v from r[T]. 
For each i £ lb(S) such that r[Xi] = v we remove i from t[S]. With CI we are sure that 
none of the i in lb(S) have r[Xi] = v, thanks to property P7 and the fact that v £ lb(T). 
With C3 we are sure that none of the i in lb(S) have r[Xi] — v, thanks to property P5 
and the fact that v £ lb(T). There remains to check C2. For each i E lb(S), we know 
that 3v' 7^ v,v' E D(Xi) fl ub(T), thanks to properties P3 and P5. We set Xi to v' in 
t, we add v' to t[T) and add all k with r[Xfe] = v' to t[S]. We are sure that k E ub(S) 
because v' E ub(T) plus condition C2 and property P8. 

Completeness on ub(T), lb(S), ub(S) and X^s arc shown with similar proofs. Let 
v E ub(T) \ t[T]. (Again C4 is irrelevant.) We show that there exists a solution with 
v € t[T]. Add v to r[T] and for each i E u6(S), if r[A 4 ] = u, put i in r[5]. C2 is solved 
thanks to property P8 and the fact that v E ub(T). C3 is solved thanks to property 
P6 and the fact that v E ub{T). There remains to check CI. For each i £ ub(S) and 
r[Xi] = v, we know that 3v' ^ v,v' e D{Xi) \ lb(T) (thanks to properties P4 and P6). 
We set Xi to v' in r and remove v' from t[T]. Each k with r[Xk] = v' is removed 
from t[S], and this is possible because we are in condition CI, v' £ lb(T), and thanks to 
property P7. 

Let v e D(Xi) and r[Xi] = v',v' ^ v. (C3 is irrelevant.) Assign v to Xi in r. If 
both d and u' or none of them are in t[T], we are done. There remain two cases. First, 
if v e t[T] and w' ^ r[T], the two alternatives to satisfy ROOTS are to add i in t[S] or 
to remove v from r[T]. If i € ub(S), we add i to rfS 1 ] and we are done. If i £ ub(S), we 
know that v £ lb(T) thanks to property P4. So, v is removed from t[T] and we are sure 
that the Xj's can be updated consistently for the same reason as in the proof of lb(T). 
Second, if v £ t[T] and v' € t[T], the two alternatives to satisfy Roots are to remove i 
from t[S] or to add v to t[T]. If i ^ lb(S), we remove i from rfS 1 ] and we are done. If 
i € lb(S), we know that v € ub(T) thanks to property P3. So, w is added to t[T] and we 
are sure that the Xj's can be updated consistently for the same reason as in the proof 
of ub{T)\r[T]. 

Let i £ lb(S) and i e t[S]. We show that there exists a solution with i £ t[S]. We 
remove i from t[S]. Thanks to property PI, we know that D(Xi) % lb(T). So, we set 
Xi to a value v' E D(Xi) \ lb(T). With C4 we are done because we are sure v' £ t[T}. 
With conditions CI, C2, and C3, if v' E t[T], we remove it from t[T] and we are sure 
that the Xj's can be updated consistently for the same reason as in the proof of lb(T). 

Let i E ub(S) \ t[S]. Wc show that there exists a solution with i E t[S]. We add i 
to t[S}. Thanks to property P2, wc know that D(Xi) n ub(T) ^ 0. So, wc set X t to a 
value v' E D(Xi)r\ub(T). With condition C4 we are done because we are sure v' E t[T]. 
With conditions CI, C2, and C3, if v' t[T], we add it to r[T] and we are sure that the 
Xj's can be updated consistently for the same reason as in the proof of ub(T) \ t[T]. □ 

5.4 Bound consistency 

In addition to being able to enforce HC on Roots in some special cases, enforcing HC 
on the decomposition always enforces a level of consistency at least as strong as BC. In 
fact, in any situation (even those where enforcing HC is intractable), enforcing BC on 
the decomposition enforces BC on the Roots constraint. 
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Theorem 6 Enforcing BC on i e S — > X t e T, and X % E T — > % e S for all i e [l..n] 
achieves BC on Roots ([Xi, ..,X n ],S,T), 

Proof. Soundness. Immediate. 

Completeness. The proof follows the same structure as that in Theorem [5] We relax the 
properties P1-P4 into properties Pl'-P4'. 

PI' if [min(Xi),max(Xi)} C lb(T) then i e lb(S) 

P2' if [min(Xi), max(Xi)} n ub(T) = then i ub(S) 

P3' if i € lb(S) then the bounds of Xi are included in ub{T) 

P4' if i ub(S) then the bounds of Xi are outside lb(T) 

Let us prove that lb(T) and ub(T) are tight. Let o be the total ordering on D = 
U, £>(Xj) Uu&(T). Build the tuples a and r as follows: For each v € lb(T): put v in er[T] 
and t[T]. For each i> e ub(T) \ lb(T), following o, do: put v in cr[T] or t[T] alternately. 
For each i <E lb(S), P3' guarantees that both mm(Xj) and max(Xi) are in ub(T). By 
construction of cr[T] (and r[T]) with alternation of values, if min(Xi) ^ max(Xi), we 
are sure that there exists a value in <r[T] (in r[T]) between min(Xi) and maar(Xj). In 
the case |£)(Jfj)| = 1, P5 guarantees that the only value is in cr[T] (in r[T]). Thus, we 
assign X, in a (in t) with such a value in <r[T] (in r[T]). For each i ub(S), we assign 
Xi in a with a value in [min(Xi), max(Xi)] \o~[T] (the same for r). We know that such a 
value exists with the same reasoning as for i S lb(S) on alternation of values, and thanks 
to P4' and P6. We complete a and r by building a[S] and t[>S] consistently with the 
assignments of Xi and T. The resulting tuples satisfy Roots. From this we deduce that 
lb(T) and ub(T) are BC as all values in ub(T) \ lb(T) are either in a or in r, but not 
both. 

We show that the Xi are BC. Take any Xi and its lower bound min(Xi). If i € Z6(5) 
we know that min(Xi) is in T either in cr or in t thanks to P3' and by construction of 
a and r. We assign min(Xi) to in the relevant tuple. This remains a solution of 
Roots. If i ^ ub(S), we know that min(Xi) is outside T either in cr or in r thanks to 
P4' and by construction of a and t. We assign min(Xi) to Xj in the relevant tuple. 
This remains a solution of ROOTS. If i G ub(S) \ lb(S), assign Xi to min(Xi) in ct. If 
min(Xi) ^ er[T], remove i from a[S] else add i to a[S]. The tuple obtained is a solution 
of Roots using the lower bound of JQ. By the same reasoning, we show that the upper 
bound of Xi is BC also, and therefore, all Xi's are BC. 

We prove that lb(S) and ub(S) are BC with similar proofs. Let us show that ub(S) 
is BC. Take any Xi with i S ub(S) and i ^ cIS"]- Since Xi was assigned any value from 
[min(Xi),max(Xi)] when cr was built, and since we know that [min(Xi),max(Xi)] 
ub(T) ^ thanks to P2', we can modify a by assigning Xi a value in ub(T), putting the 
value in T if not already there, and adding % into S. The tuple obtained satisfies ROOTS. 
So ub(S) is BC. 

There remains to show that lb(S) is BC. Thanks to PI', we know that values i 6 
ub{S)\lb{S) are such that [min(Xi),max(Xi)]\lb(T) ^ 0. Taket; G [rran(JQ), max(Xj)]\ 
lb(T). Thus, either a or r is such that v ^ T. Take the corresponding tuple, assign JQ 
to u and remove z from 5. The modified tuple is still a solution of Roots and lb(S) is 
BC. □ 
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5.5 Implementation details 

This decomposition of the ROOTS constraint can be implemented in many solvers using 
disjunctions of membership and negated membership constraints: or(member(i, S), notmember(JQ, T)) 
and or (notmember(i, S), member (JQ, T)). However, this requires a little care. Unfortu- 
nately, some existing solvers (like Hog Solver) may not achieve HC on such disjunctions 
of primitives. For instance, the negated membership constraint notmember(A"j, T) may 
be activated only if Xi is instantiated with a value of T (whereas it should be as soon as 
D{Xj) C lb(T)). We have to ensure that the solver wakes up when it should to ensure 
we achieve HC. As we explain in the complexity proof, we also have to be careful that 
the solver does not wake up too often or we will lose the optimal 0(nd) time complexity 
which can be achieved. 

Theorem 7 It is possible to enforce HC (or BC) on the decomposition of ROOTS ([Xi, .., X n ], S, T) 
in 0(nd) time, where d = max(yi.\D(Xi)\, \ub(T)\). 

Proof. The decomposition of ROOTS is composed of 2n constraints. To obtain an overall 
complexity in 0(nd) 7 the total amount of work spent propagating each of these constraints 
must be in 0{d) time. 

First, it is necessary that each of the 2n constraints of the decomposition is not called 
for propagation more than d times. Since S can be modified up to n times (n can be 
larger than d) it is important that not all constraints are called for propagation at each 
change in lb(S) or ub(S). By implementing 'propagating events' as described in [20ll30| . 
we can ensure that when a value i is added to lb(S) or removed from ub(S), constraints 
j G S — ► Xj G T and Xj G T — > j G S, j ^ i, arc not called for propagation. 

Second, we show that enforcing HC on constraint i G S — * Xi G T is in 0(d) time. 
Testing the precondition (does i belong to lb(S)l) is constant time. If true, removing 
from D{Xi) all values not in ub(T) is in 0(d) time and updating lb(T) (if |D(Xj)| = 1) is 
constant time. Testing that the postcondition is false (is D(Xi) disjoint from ub(T)l) is 
in 0(d) time. If false, updating ub(S) is constant time. Thus HC on i G S — > Xi G T is in 
0(d) time. Enforcing HC on Xi G T — ► i G S is in 0(d) time as well because testing the 
precondition (D(Xi) C lb(T)l) is in 0(d) time, updating lb(S) is constant time, testing 
that the postcondition is false (i £ ub(S)?) is constant time, and removing from D(Xi) 
all values in lb(T) is in O(d) time and updating ub(T) (if = 1) is constant time. 

When T is modified, all constraints are potentially concerned. Since T can be modified 
up to d times, we can have d calls of the propagation in 0(d) time for each of the 2n 
constraints. It is thus important that the propagation of the 2n constraints is incremental 
to avoid an 0(nd 2 ) overall complexity. An algorithm for i E S — > Xi G T is incremental 
if the complexity of calling the propagation of the constraint i S S — > Xj G T up to d 
times (once for each change in T or D(Xi)) is the same as propagating the constraint 
once. This can be achieved by an AC2001-likc algorithm that stores the last value found 
in D(Xi) n ub(T), which is a witness that the postcondition can be true. (Similarly, the 
last value found in D(Xi) \ lb(T) is a witness that the precondition of the constraint 
Xi G T —> i G S can be false.) Finally, each time lb(T) (resp. ub(T)) is modified, 
D(Xi) must be updated for each i outside ub(S) (resp. inside lb(S)). If the propagation 
mechanism of the solver provides the values that have been added to lb(T) or removed 
from ub(T) to the propagator of the 2n constraints (as described in [33 ), updating a 
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given D(Xi) has a total complexity in 0(d) time for the d possible changes in T. The 
proof that BC can also be enforced in linear time follows a similar argument. □ 

6 A catalog of decompositions using RANGE and ROOTS 

We have shown how to propagate the Range and ROOTS constraints. Specification of 
counting and occurrence constraints using Range and Roots will thus be executable. 
Range and ROOTS permit us to decompose counting and occurrence global constraints 
into more primitive constraints, each of which having an associated polynomial propa- 
gation algorithm. In some cases, such decomposition does not hinder propagation. In 
other cases, enforcing local consistency on the global constraint is intractable, and de- 
composition is one method to obtain a polynomial propagation algorithm [T31 [23 HI] ■ 

In a technical report [9], we present a catalog containing over 70 global constraints 
from [5] specified with the help of the Range and ROOTS constraints. Here we present a 
few of the more important constraints. In the subsequent five subsections, we list some 
counting and occurrence constraints which can be specified using Range constraints, 
using Roots constraints, and using both Range and ROOTS constraints. We also show 
that Range and Roots can be used to specify open global constraints, a new kind of 
global constraints introduced recently. We finally include problem domains other than 
counting and occurrence to illustrate the wide range of global constraints expressible in 
terms of Range and Roots. 

6.1 Applications of Range constraint 

Range constraints are often useful to specify constraints on the values used by a sequence 
of variables. 

6.1.1 All different 

The AllDifferent constraint forces a sequence of variables to take different values 
from each other. Such a constraint is useful in a wide range of problems (e.g. allocation 
of activities to different slots in a time-tabling problem) . It can be propagated efficiently 
[25) . It can also be decomposed with a single Range constraint: 

AllDifferent([Xl, .., X n ]) iff 
RANGE([A 1 ,..,A„],{l,..,n},T) A |T| = n 

A special but nevertheless important case of this constraint is the Permutation 
constraint. This is an AllDifferent constraint where we additionally know R, the set 
of values to be taken. That is, the sequence of variables is a permutation of the values 
in R where \R\ = n. This also can be decomposed using a single Range constraint: 

Permutation^^!, .., X n ], R) iff 

RANGE([Xi,..,X„],{l,..,n},i?) 
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Such a decomposition of the Permutation constraint obviously does not hinder 
propagation. However, decomposition of AllDifferent into a Range constraint does. 
This example illustrates that, whilst many global constraints can be expressed in terms 
of Range and Roots, there are some global constraints like AllDifferent for which 
it is worth developing specialised propagation algorithms. Nevertheless, Range and 
Roots provide a means of propagation for such constraints in the absence of specialised 
algorithms. They can also enhance the existing propagators. For instance, HC on the 
Range decomposition is incomparable to AC on the decomposition of AllDifferent 
which uses a clique of binary inequality constraints. Thus, we may be able to obtain 
more pruning by using both decompositions. 

Theorem 8 (1) GAC on Permutation is equivalent to HC on the decomposition with 
Range. (2) GAC on AllDifferent is stronger than HC on the decomposition with 
Range. (3) AC on the decomposition of AllDifferent into binary inequalities is 
incomparable to HC on the decomposition with RANGE. 

Proof: (1) Permutation can be encoded as a single Range. Moreover, since R is 
fixed, HC is equivalent to AC. (2) Consider X t , X 2 G {1,2}, X 3 S {1,2,3,4}, and 
{1,2} C T C {1,2,3,4}. Then RANGE ( [X\ , X 2 , X 3 ] , { 1 , 2, 3} , T) and \T\ = 3 are both 
HC, but AllDifferent([Xi,X 2 ,X 3 ]) is not GAC. (3) Consider X u X 2 € {1,2}, X 3 e 
{1,2,3}, and T = {1,2,3}. Then X 1 ^ X 2 , X 1 ^ X 3 and X 2 # X 3 are AC but 
Range([Xi,X 2 ,X 3 ],{1,2,3},T) is not HC. Consider X x , X 2 e {1,2,3,4}, X 3 e {2}, 
and {2} C T C {1,2,3,4}. Then RANGE([Xi, X 2 , X 3 ], {1, 2, 3}, T) and \T\ = 3 are HC. 
But Xi ^ X 3 and X 2 ^ X 3 are not AC. □ 

6.1.2 Disjoint 

We may require that two sequences of variables be disjoint (i.e. have no value in common). 
For instance, two sequences of tasks sharing the same resource might be required to 
be disjoint in time. The DISJOINT ([Xi, .., X n ], [Yi, .., Y m \) constraint introduced in 2 
ensures X{ ^ Yj for any i and j. We prove here that we cannot expect to enforce GAC 
on such a constraint as it is NP-hard to do so in general. 

Theorem 9 Enforcing GAC on Disjoint is NP-hard. 

Proof: We reduce 3-SAT to the problem of deciding if a Disjoint constraint has any 
satisfying assignment. Finding support is therefore NP-hard. Consider a formula ip 
with n variables and m clauses. For each Boolean variable x, we let X x € {x, ^x} and 
Yj € {x, ->y, z} where the jth clause in tp is x V V z. If (p has a model then the 
Disjoint constraint has a satisfying assignment in which the X x take the literals false 
in this model. □ 
One way to propagate a Disjoint constraint is to decompose it into two Range 
constraints: 

DisjoiNT([x 1 ,..,x n ],[y 1 ,..,y m ]) iff 

Range([X 1 ,..,X„],{1,..,t 1 },5) a 

R A NGE([yi,..,y; n ],{i,..,m},T) a sdt = {} 
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Enforcing HC on this decomposition is polynomial. Decomposition thus offers a simple 
and promising method to propagate a Disjoint constraint. Not surprisingly, the de- 
composition hinders propagation (otherwise we would have a polynomial algorithm for 
a NP-hard problem). 

Theorem 10 GAC on Disjoint is stronger than HC on the decomposition. 

Proof: Consider X u Y x G {1,2}, X 2 ,Y 2 G {1,3}, Y 3 G {2,3} and {} C S*,T C {1,2,3}. 
Then Range([Xl,X 2 ], {1, 2}, S) and Range([Yi, Y 2 , Y 3 ], {1, 2, 3}, T) are HC, and SnT = 
{} is BC. However, enforcing GAC on DlSJOlNT([Xi, X 2 ], [Yi,Y 2 ,Y 3 ]) prunes 3 from X 2 
and 1 from both Y\ and Y 2 . □ 

6.1.3 Number of values 

The NValtje constraint is useful in a wide range of problems involving resources since 
it counts the number of distinct values used by a sequence of variables [551 [HI E]. As 
we saw in Section [3j N Value( {X\ X n ] , N) holds iff N = \{Xi | 1 < i < n}\. The 
AllDifferent constraint is a special case of the NValue constraint in which N = n. 
Unfortunately, it is NP-hard in general to enforce GAC on a NValue constraint [IB"] . 
However, there is an 0(n log(n)) algorithm to enforce a level of consistency similar to BC 
|3J. An alternative and even simpler way to implement this constraint is with a Range 
constraint: 



N Value( [Xl ,..,X n ],N) iff 
RANGE([X 1) ..,X„],{l,..,n},T) A \T\=N 

HC on this decomposition is incomparable to BC on the NValue constraint. 

Theorem 11 BC on NValue is incomparable to HC on the decomposition. 

Proof: Consider X U X 2 £ {1,2}, X 3 G {1,2,3,4}, N G {3} and {} C T C {1,2,3,4}. 
Then Range([Vi, X 2 , X 3 ], {1, 2, 3}, T) and |T| = N are both HC. However, enforcing 
BC on NValue([Vi, X 2 , X 3 ], N) prunes 1 and 2 from X 3 . 

Consider X 1 ,X 2 ,X 3 G {1,3} and N G {3}. Then NValue([Vi, X 2 , X 3 ], N) is BC. 
However, enforcing HC on Range([Vi, X 2 , X 3 ], {1, 2, 3}, T) makes{} CTC {1,3} which 
will cause |T| = 3 to fail. □ 

6.1.4 Uses 

In [5], propagation algorithms achieving GAC and BC are proposed for the UsedBy 
constraint. UsedBy([Xl, ..,X n ], [Yi, ..,Y m )) holds iff the multiset of values assigned to 
Yi, .., Y m is a subset of the multiset of values assigned to X\, .., X n . We now introduce a 
variant oftheUSEDBY constraint called the Uses constraint. Uses([Xl, ..,X n ], [Yi,, .., Y m \) 
holds iff the set of values assigned to Y%, .., Y m is a subset of the set of values assigned to 
Xi, .., X n , That is, UsedBy takes into account the number of times a value is used while 
Uses does not. Unlike the UsedBy constraint, enforcing GAC on Uses is NP-hard. 

Theorem 12 Enforcing GAC on Uses is NP-hard. 
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Proof: We reduce 3-SAT to the problem of deciding if a Uses constraint has a solution. 
Finding support is therefore NP-hard. Consider a formula tp with n Boolean variables 
and m clauses. For each Boolean variable x, we introduce a variable X x £ {x, —x}. For 
each clause Cj = x V V z, we introduce Yj £ {x, —y, z}. Then ip has a model iff the 
Uses constraint has a satisfying assignment, and x is true iff X x — x. □ 
One way to propagate a Uses constraint is to decompose it using Range constraints: 

UsEs([x 1 ,..,x„],[r 1 ,..,y m ]) iff 

RANGE([X 1 ,..,X n ],{l,..,n},T) A 
Range([Yi, .., Y m ], {1, .., m}, T') A T'CT 

Enforcing HC on this decomposition is polynomial. Not surprisingly, this hinders 
propagation (otherwise we would have a polynomial algorithm for a NP-hard problem) . 

Theorem 13 GAC on Uses is stronger than HC on the decomposition. 

Proof: Consider X 1 € {1,2,3,4}, X 2 € {1,2,3,5}, A 3 ,A 4 € {4,5,6}, Y 1 € {1,2}, 
Y 2 & {1,3}, and Y 3 E {2,3}. The decomposition is HC while GAC on Uses prunes 4 
from the domain of X\ and 5 from the domain of X 2 . □ 
Thus, decomposition is a simple method to obtain a polynomial propagation algo- 
rithm. 

6.2 Applications of Roots constraint 

Range constraints are often useful to specify constraints on the values used by a sequence 
of variables. Roots constraint, on the other hand, are useful to specify constraints on 
the variables taking particular values. 

6.2.1 Global cardinality 

The global cardinality constraint introduced in [26j constrains the number of times values 
are used. We consider a generalization in which the number of occurrences of a value 
may itself be an integer variable. More precisely, Gcc([A"!, ..,X n ], [dx, .., d m ], [6\, .., O m \) 
holds iff \{i | Xi = dj}\ = Oj for all j. Such a Gcc constraint can be decomposed into a 
set of Roots constraints: 

gcc([jci, ..,x n ], [di, .., d m ], px, .., cy ) iff 

Vi . RootsQXi, ..,.*„], A \Si\ = Oi 

Enforcing HC on these ROOTS constraints is polynomial since the sets {di} are ground 
(See Theorem [5]). Enforcing GAC on a generalised Gcc constraint is NP-hard, but we 
can enforce GAC on the Xi and BC on the Oj in polynomial time using a specialised 
algorithm [24j . This is more than is achieved by the decomposition. 

Theorem 14 GAC on the Xi and BC on the Oj of a Gcc constraint is stronger than 
HC on the decomposition using ROOTS constraints. 
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Proof: As sets are represented by their bounds, HC on the decomposition cannot prune 
more on the Oj than BC does on the Gcc. To show strictness, consider Xi, X2 <E {1, 2}, 
X 3 e {1,2,3}, di = i and Oi,0 2 ,0 3 S {0,1}. The decomposition is HC (with {} C 
Si,S 2 C {1,2,3} and {}CS 3 C {3}). However, enforcing GAC on the X { and BC on 
the Oj of the Gcc constraint will prune 1 and 2 from X3 and from 0\ , O2 and O3 . □ 
This illustrates another global constraint for which it is worth developing a specialised 
propagation algorithm. 

6.2.2 Among 

The Among constraint was introduced in CHIP to help model resource allocation prob- 
lems like car sequencing [4] . It counts the number of variables using values from a given 
set. Among([Ai,..,X„], [di,..,d m ],A0 holds iff TV = \{i \ Xi e {d u .., d m }}\. 

An alternative way to propagate the Among constraint is to decompose it using a 
Roots constraint: 

AmongQX 1 , .., X n ] , [d-t , .., d m ] , N) iff 
RoOTs([X 1 ,..,A„],5,{di,..,d m }) A \S\ = N 

It is polynomial to enforce HC on this case of the ROOTS constraint since the target 
set is ground. This decomposition also does not hinder propagation. It is therefore a 
potentially attractive method to implement the Among constraint. 

Theorem 15 GAC on Among is equivalent to HC on the decomposition using Roots. 

Proof: Suppose the decomposition into Roots(LYi, ..,X n ], S, {di, ..,d m }) and |5| = N 
is HC. The variables Xi divide into three categories: those whose domain only contains 
elements from {d l7 ..,d m } (at most min(JV) such variables); those whose domain do not 
contain any such elements (at most n — max(iV) such vars); those whose domain contains 
both elements from this set and from outside. Consider any value for a variable Xi in the 
first such category. To construct support for this value, we assign the remaining variables 
in the first category with values from {c?i, .., d m }. If the total number of assigned values 
is less than min(TV), we assign a sufficient number of variables from the second category 
with values from {di,..,d m } to bring up the count to min(TV). We then assign all the 
remaining unassigned Xj with values outside {d\, ..,d m }. Finally, we assign min(A r ) to 
N. Support can be constructed for variables in the other two categories in a similar way, 
as well as for any value of N between min(TV) and max(A). □ 

6.2.3 At most and at least 

The AtMost and AtLeast constraints are closely related. The AtMost constraint puts 
an upper bound on the number of variables using a particular value, whilst the AtLeast 
puts a lower bound. For instance, AtMostQXl, .., X n ],d, N) holds iff \{i \ Xi = d}\ < N. 
Both AtMost and AtLeast can be decomposed into Roots constraints. For example: 

ATMOST([Xi,..,X„],d,A0 iff 

RooTs([Xi,..,X n ],S,{d}) A \S\<N 
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Again it is polynomial to enforce HC on these cases of the Roots constraint, and the 
decomposition does not hinder propagation. Decomposition is therefore also a potential 
method to implement the AtMost and AtLeast constraints in case we do not have 
such constraints available in our constraint toolkit. 

Theorem 16 G AC on AtMost is equivalent to HC on the decomposition. Roots([Xi, .., X n ], S, {d}) 
and on \S\ < N . 

GAC on AtLeast is equivalent to HC on the decomposition. RootsQXi, ..,X n ], S, {d}) 
and on \S\ > N . 

Proof: The proof of the last theorem can be easily adapted to these two constraints. □ 

6.3 Applications of Range and Roots constraints 

Some global constraints need both Range and Roots constraints in their specifications. 
6.3.1 Assign and number of values 

In bin packing and knapsack problems, we may wish to assign both a value and a bin to 

each item, and place constraints on the values appearing in each bin. For instance, in the 

steel mill slab design problem (prob038 in CSPLib) , we assign colours and slabs to orders 

so that there are a limited number of colours on each slab. Assign&NValues([A 1 , .., X n ], [Yi, .., Y n ], N) 

holds iff \{Yi | Xi = j}\ < N for each j [2J. We cannot expect to enforce GAC on such a 

constraint as it is NP-hard to do so in general. 

Theorem 17 Enforcing GAC on Assign&NValues is NP-hard. 

Proof: Deciding if the constraint AtMostNValue has a solution is NP-complete, where 
AtMostNValue([Yi, ..,Y n ],N) holds iff |{Y, | 1 < i < n}\ < N 0(7]. The problem of 
the existence of a solution in this constraint is equivalent to the problem of the existence of 
a solution in ASSIGN&N VALUES ([Xi, ..,X n ], [Yi, ..,Y n ], N) where D(Xi) = {0},Vi<E l..n. 
Deciding whether Assign&NValues is thus NP-complete and enforcing GAC is NP- 
hard. □ 
Assign&NValues can be decomposed into a set of Range and Roots constraints: 



Assign&NValuesQA!, ..,X n ], [Yi, ,.,Y n ],N) iff 
Vj . RooTs([X u ..,X n ],S s ,{j}) A 

RangeQYx,..^], A \Tj\<N 

However, this decomposition hinders propagation. 

Theorem 18 GAC on AssiGN&N Values is stronger than HC on the decomposition. 

Proof: Consider N = 1, X 1: X 2 € {0}, Y 1 e {1,2}, Y 2 e {2,3}. HC on the decompo- 
sition enforces Sq = {1,2} and {} C To C {1,2,3} but no pruning on the Xi and Yj. 
However, enforcing GAC on AssIGN&NValues([Xi, X 2 ], [Yi, Y 2 ], N) prunes 1 from Yi 
and 3 from Y 2 . □ 
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6.3.2 Common 



A generalization of the Among and AllDifferent constraints introduced in [2] is the 
COMMON constraint. Common(A, M, [X 1 , ..,X n ], [Yi, ..,Y m ]) ensures N = \{i \ 3j, A., = 
Yj}\ and M = \{j \ 3i, Aj = Yj}\. That is, N variables in Xi take values in common with 
Yj and M variables in Yj takes values in common with Xi. We prove that we cannot 
expect to enforce GAC on such a constraint as it is NP-hard to do so in general. 

Theorem 19 Enforcing GAC on Common is NP-hard. 

Proof: We again use a transformation from 3-SAT. Consider a formula tp with n Boolean 
variables and m clauses. For each Boolean variable i, we introduce a variable Xi G {i, —i}. 
For each clause Cj — x V V z, we introduce Yj S {%, —y, z}. We let N £ {0, .., n} and 
M = m. ip has a model iff the Common constraint has a solution in which the A; take 
the literals true in this model. □ 
One way to propagate a Common constraint is to decompose it into Range and 
Roots constraints: 

Common (N, M, [X x X n ] , [Yi , . . , Y m ] ) iff 
RANGE([y 1 ,..,r m ],{l,..,m},T) A 
Roots ( [A x , .., X n ],S, T) A \S\ = N A 
RANGE([X 1 ,..,X„],{l,..,n},F) A 
Roots ([Fi, .., Y m ], U, V) A \U\ = M 

Enforcing HC on this decomposition is polynomial. Decomposition thus offers a 
simple method to propagate a Common constraint. Not surprisingly, the decomposition 
hinders propagation. 

Theorem 20 GAC on Common is stronger than HC on the decomposition. 

Proof: Consider N = M = 0, X 1 ,Y 1 e {1,2}, X 2 ,Y 2 € {1,3}, Y 3 € {2,3}. Hybrid 
consistency on the decomposition enforces {} C T, V C {1,2,3}, and S = U = {} but no 
pruning on the X t andl}. However, enforcing GAC on Common(A, M, [Ai,A 2 ], [Yi, Y 2 , Y 3 }) 
prunes 2 from X±, 3 from X 2 and 1 from both Y\ and Y 2 . □ 

6.3.3 Symmetric all different 

In certain domains, we may need to find symmetric solutions. For example, in sports 
scheduling problems, if one team is assigned to play another then the second team should 
also be assigned to play the first. SymAllDiff([Ai, ..,X n ]) ensures A", = j iff Xj = i 
P?] . It can be decomposed into a set of Range and Roots constraints: 

SymAllDiff([Ai,..,A„]) iff 
Range([A!, .., A„], {1, .., n}, {1, .., n}) A 
Vi.R00TS([A 1 ,..,A n ],^,{ i }) A Xi€Si A 1^1 = 1 
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It is polynomial to enforce HC on these cases of the Roots constraint. However, as 
with the AllDifferent constraint, it is more effective to use a specialised propagation 
algorithm like that in [27] . 

Theorem 21 GAC on SymAllDiff is stronger than HC on the decomposition. 

Proof: Consider X x e {2,3}, X 2 € {1,3}, X 3 e {1,2}, {} C Si C {2,3}, {} C 5 2 C 
{1,3}, and {} C S 3 C {1,2}. Then the decomposition is HC. However, enforcing GAC 
on SymAllDiff([Xl, X 2 , X 3 ]) will detect unsatisfiability. □ 
To our knowledge, this constraint has not been integrated into any constraint solver. 
Thus, this decomposition provides a means of propagation for the SymAllDiff con- 
straint. 

6.3.4 Uses 

In Section 16.1.41 we decomposed the constraint Uses with Range constraints. Another 
way to propagate a Uses constraint is to decompose it using both Range and Roots 
constraints: 

usEs([x 1) ..,x n ] s [y 1 ,.. s y m ]) iff 

RANGE([X 1 ,..,X„],{l,..,n},T) A 
R00TS([Yi,..,Y m ],{l,..,m},T) 

Enforcing HC on this decomposition is polynomial. Again, such a decomposition 
hinders propagation as achieving GAC on a Uses constraint is NP-Hard. Interestingly, 
the decomposition of Uses using Range constraints presented in Section 16.1.41 and the 
decomposition presented here are equivalent. 

Theorem 22 HC on the decomposition of Uses using only Range constraints is equiv- 
alent to HC on the decomposition using Range and Roots constraints. 

Proof: We just need to show that HC on RootsQYI, .., Y m ], {1, .., to}, T) is equivalent 
to HC on RangeQYi, .., Y m ], {1, .., to}, T') AT' CT. Since, the Range and the Roots 
constraints are over the same set of variables ([Y\, ..,Y m )) and the same set of indices 
({1, .., to}) is fixed for both, then it follows that set variable T" maintained by Range is 
a subset of T maintained by Roots. □ 

6.4 Open constraints 

Open global constraints have recently been introduced. They are a new kind of global 
constraints for which the set of variables involved is not fixed. Range and ROOTS 
constraints are particularly useful to specify many such open global constraints. 

The Gcc constraint has been extended to OpenGcc, a Gcc constraint for which 
the set of variables involved is not known in advance [34] . Given variables X\, .., X n and 
a set variable S, C S C {l..n}, OpenGcc([Xi, ..,X n ],S, [di, .., dm], [Oi, .., O m ]) holds 
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iff \{i £ S I Xi = dj}\ = Oj for all j. OpenGcc can be decomposed into a set of Roots 
constraints in almost the same way as Gcc was decomposed in Section f6. 2. II 



OpenGcc([X 1 , ,.,X n ], S, [di, .., d m ], [Ox, .., O m }) iff 
S= (J Si A 

Vt . RooTs([X u ..,X n ], S h {di}) A 15,1 = 0, 

Propagators for such an open constraint have not yet been included in constraint 
solvers. In [34] . a propagator is proposed for the case where CVs are ground intervals. 
In the decomposition above, the CVs can either be variables or ground intervals. How- 
ever, even when CVs are ground intervals, both the decomposition and the propagator 
presented in [34j hinder propagation and are incomparable to each other. 

Theorem 23 Even if Oi 's are ground intervals, (1) HC on the OpenGcc constraint is 
stronger than HC on the decomposition using ROOTS constraints, (2) the propagator in 
and HC on the decomposition using Roots constraints are incomparable. 

Proof: (1) Consider X U X 2 G {1,2}, X 3 G {1,2,3}, d t = i, S = {1,2,3} and 
O 1} 02,O 3 = [0,1]. The decomposition is HC (with {} C Sx,S 2 C {1,2,3} and {} C 
5*3 C {3}). However, enforcing HC on the OpenGcc constraint will prune 1 and 2 from 
X 3 . 

(2) Consider the example in case (1). The propagator in [33] will prune 1 and 2 from 
X 3 whereas the decomposition is HC. Consider Xi £ {1,2}, X 2 £ {2,3}, X 3 G {3,4}, 
d t = i, {} C S C {1,2,3} and O x = [1,1], 2 = [0,1], 3 = [0,0], 4 = [0,0]. The 
propagator in [34j will prune the only value in the Xj variables which is not HC, that 
is, value 2 for X\. It will not prune the bounds on S. However, enforcing HC on the 
decomposition using Roots constraints will set Si = {1}, then will prune value 2 for 
X\, will shrink S 2 to {} C S2 C {2}, will set S3 = S4 — {} and will finally shrink S to 
{1} CSC {1,2}. □ 

As observed in [33], the definition of OpenGcc subsumes the definition for the open 
version of the AllDifferent constraint. Given variables X\, ..,X n and a set variable 
S, C S C {l..n}, OpenAllDifferent([Ai,..,A„],S') holds iff X 4 ^ Xj,Vi,j G S. 
Interestingly, this constraint can be decomposed using Range in almost the same way 
as AllDifferent was decomposed in Section lB.l.ll 

OpenAllDifferentQXl, .,,X n ],S) iff 
Range([X 1! ..,X„],5,T) A |S*| = |T| 

Not surprisingly, this decomposition hinders propagation (see the example used in 
Theorem [5] to show that the decomposition of AllDifferent using Range hinders 
propagation). Nevertheless, as in the case of OpenGcc, we do not know of any polyno- 
mial algorithm for achieving HC on OpenAllDifferent. 
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6.5 Applications beyond counting and occurrence constraints 



The Range and ROOTS constraints arc useful for specifying a wide range of counting 
and occurrence constraints. Nevertheless, their expressive power permits their use to 
specify many other constraints. 

6.5.1 Element 

The Element constraint introduced in [3T] indexes into an array with a variable. More 
precisely, Element(7, [X\, ..,X n ], J) holds iff Xj = J. For example, we can use such 
a constraint to look up the price of a component included in a configuration problem. 
The Element constraint can be decomposed into a Range constraint without hindering 
propagation: 

Element (7, [Xi, .., X n ], J) iff \S\ = \T\ = l A 
I e S A J eT A RANGE([Xi,..,X n ],S,T) 

Theorem 24 GAC on Element is equivalent to HC on the decomposition. 

Proof: S has all the values in the domain of I in its upper bound. Similarly T has all 
the values in the domain of J in its upper bound. In addition, S and T are forced to 
take a single value. Thus enforcing HC on Range([-X~i, .., X n ], S, T) has the same effect 
as enforcing GAC on Element(J, [Xi, ..,X n ], J). □ 

6.5.2 Global contiguity 

The Contiguity constraint ensures that, in a sequence of 0/1 variables, those taking 
the value 1 appear contiguously. This is a discrete form of convexity. The constraint was 
introduced in [21j to model a hardware configuration problem. It can be decomposed 
into a ROOTS constraint: 

Contiguity ([ X i , -,X n ]) iff 
RooTS([Xi,..,X n ],5,{l}) A 
X = max(S) A Y = mm{S) A \S\ = X - Y + 1 

Again it is polynomial to enforce HC on this case of the ROOTS constraint. Unfortunately, 
decomposition hinders propagation. Whilst Range and Roots can specify concepts 
quite distant from counting and occurrences like convexity, it seems that we may need 
other algorithmic ideas to propagate them effectively. 

Theorem 25 GAC on Contiguity is stronger than HC on the decomposition. 

Proof: Consider X\,X% € {0, 1}, X2,X^ E {1}. Hybrid consistency on the decompo- 
sition will enforce {2,4} CSC {1,2,3,4}, X £ {4}, Y € {1,2} and |5| to be in {3,4} 
but no pruning will happen. However, enforcing GAC on Contiguity([Ai, .., X n ]) will 
prune from X%. □ 



26 



7 Experimental results 



We now experimentally assess the value of using the Range and Roots constraints 
in specifying global counting and occurrence constraints. For these experiments, we 
implemented an algorithm achieving HC on Range and an algorithm achieving HC on 
the decomposition of Roots presented in Section 15.21 Note that our algorithm for the 
decomposition of Roots does not use the Hog Solver primitives member(i>a^ite, set) and 
notmember(uaZue, set) because Hog Solver does not appear to give complete propagation 
on combinations of such primitives (see the discussion in Section l5.5p . We therefore 
implemented our own algorithms from scratch. 

7.1 Pruning power of ROOTS 

In Section 15.21 we proposed a decomposition of the Roots constraint into simple impli- 
cations. The purpose of this subsection is to measure the pruning power of HC on the 
decomposition of Roots with respect to HC on the original ROOTS constraint when we 
do not meet any of the conditions that make HC on the decomposition equivalent to HC 
on the original constraint (see Section [53)1 . We should bear in mind that enforcing HC 
on the ROOTS constraint is NP-hard in general. In order to enforce HC on the ROOTS 
constraint we used a simple table constraint (i.e., a constraint in extension) that has an 
exponential time and space complexity. Consequently, the size of the instances on which 
we were able to run this filtering method was severely constrained. 

An instance is a set of integer variables {Xi, ..X n } and two set variables S and T. 
It can be described by a tuple (n, m,fc, r). The parameter n stands for the number of 
integer variables. These n variables are initialised with the domain {1, ...,m}. The 
upper bound of S is initialised with {1, . . . , n} and the upper bound of T is initialised 
with {1, . . . , m}. The parameter k corresponds to the number of elements of the set 
variable S (resp. set variable T) that are, with equal probability, either put in the lower 
bound or excluded from the upper bound of S (resp. of T). Finally, the parameter r 
is the total number of values removed, with uniform probabilities from the domains of 
the integer variables, keeping at least one value per domain. We generated 1000 random 
instances for each combination of n, m € [4, ..6], k £ [l..min(n, m)] and r G [l..n(m— 1)]. 

For each one of the instances we generated, we propagated RootsQAi, ,.X n ], S, T) 
using either the table constraint (enforcing HC), or our decomposition (enforcing HC in 
special cases). We observed that on 29 out of the 32 combinations of the parameters n, 
m and k, the decomposition achieves HC for all 1000 instances of every value of r. On 
the remaining three classes ((4, 6, 3, *), (5, 6, 3, *) and (6, 6, 3, *)), the decomposition fails 
to detect 0.003% of the inconsistent values. 

As a second experiment, we used the same instances expect that we did not fix or 
remove k values randomly from T, that is, in all instances, lb(T) = and ub(T) = 
{l,...,m}. All other settings remained equal. By doing so, we allowed the random 
domains to reach situations equivalent to that of the counter example given in the proof 
of Theorem 2J With this setting, we observed that the decomposition still achieves HC 
on 18 out of the 32 combinations of the parameters n, m and k, for all 1000 instances 
of every value of r. On the remaining classes, the percentage of inconsistent values not 
pruned by the decomposition increases to 0.039%. 
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Clearly, this experiment is limited in its scope, first by the relatively small size of the 
instances, and second by the choices made for generating random domains. However, we 
conclude that examples of inconsistent values not being detected by the decomposition 
appear to be rare. 

7.2 Pruning power and efficiency of RANGE 

Contrary to the Roots constraint, we have a complete HC propagator for the Range 
constraint. Thus, we do not need to assess the pruning power of our propagator. Never- 
theless, it can be interesting to compare the pruning power and the efficiency of decom- 
posing a global constraint using Range or using another decomposition with simpler 
constraints. 

The purpose of this subsection is to compare the decomposition of Uses using Range 
constraints against a simple decomposition using more elementary constraints. We chose 
the Uses constraint because it is NP-hard to achieve GAC on the Uses constraint (see 
Section I6.1.4|) and there is no propagator available for this constraint in the literature. 
Furthermore, one of the time-tabling problems at the University of Montpellier can easily 
be modelled as a CSP with Uses constraints. We first compare the two decompositions 
of Uses (with or without Range) in terms of run-time as well as pruning power on 
random CSPs. Then, we solve the problem of building the set of courses in the Master 
of Computer Science at the University of Montpellier with the two decompositions. 

7.2.1 Random CSPs 

In order to isolate the effect of the Range constraint from other modelling issues, we 
used the following protocol: we randomly generated instances of binary CSPs and we 
added Uses([Xi, .., X n ], [Yi, .., Y n ]) constraints. In all our experiments, we encode Uses 
in two different ways: 

[ range ] : by decomposing Uses using Range as described in Section 16.1.41 

[decomp] : by decomposing the Uses constraint using primitive constraints as described 
next. 

usEs([Xi,..,x„],[Yi,..,r„]) iff 
ie S ^XieT a j e T -^3ie S.Xi 
ieS' -^YteT' a j e T' -> 3i e S' .Yi 
T cf 

The problem instances are generated according to model B in [53], and can be de- 
scribed with the following parameters: the number of X and Y variables nx and ny in 
Uses constraints, the total number of variables nz, the domain size d, the number of 
binary constraint mi, the number of forbidden tuples t per binary constraint, and the 
number of Uses constraints m^. Note that the Uses constraints can have overlapping 
or disjoint scopes of variables. We distinguish the two cases. All reported results are 
averages on 1000 instances. 



= j A 
= i A 
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Figure 4: Propagating random binary constraint satisfaction problems with three over- 
lapping Uses constraints (class A). 

Our first experiment studies the effectiveness of decomposing Uses with Range for 
propagation alone (not solving). We compared the number of values removed by prop- 
agation on the models obtained by representing Uses constraints in two different ways, 
either using Range (range) or using the simple decomposition (decomp). To simulate 
what happens inside a backtrack search, we repeatedly and randomly choose a variable, 
assign it to one of its values and propagate the set of random binary constraints. After 
doing so for a given number of variables, if the CSP is still consistent, we enforce HC on 
each one of the two decompositions above. Hence, in the experiments, the constraints 
are exposed to a wide range of different variable domains. We report the ratio of values 
removed by propagation on the following classes of problems: 

class A : (nx — 5, rty = 10, nz = 35, d — 20, mi = 70, t = 150, = 3 {overlap)) 
class B : (nx — 5,ny — 10, nz = 45, d = 20, mi = 90, t = 150, ni2 = 3 (disjoint)} 

in which the number of assigned variables varies between 1 and 14. A failure detected 
by the propagation algorithm yields a ratio of 1 (all values are removed). 

We observe in Figures[4]and[5]that propagating the Uses constraint using the Range 
constraint (range model) is much more effective than propagating it using the decompo- 
sition using elementary constraints (decomp model). In certain cases, the range model 
more than doubles the amount of values pruned. For instance after 7 random assignments 
the decomp model prunes only 28.8% of the values for the first problem class (Fig. |4]) 
and 4.4% for the second (Fig. [5]) whilst the Range algorithm respectively prunes 56% 
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^assigned variables 



Figure 5: Propagating random binary constraint satisfaction problems with three disjoint 
Uses constraints (class B). 



and 10.2% of the values. As we see in the next experiments, such a difference in pruning 
can map to considerable savings when solving a problem. 

Our second experiment studies the efficiency of decomposing Uses with Range when 
solving the problems. Our solver used the smallest- domain- first variable ordering heuris- 
tic with the lexicographical value ordering and a cut-off at 600 seconds. We compared 
the cost of solving the two types of models: range and decomp. We report the num- 
ber of fails and the cpu-time needed to find the first solution on the following classes of 
problems: 

class C : (nx — b,ny — I 0, nz — 25, d = I 0, mi = 40, t, mi = 2) 
class D : (nx — 5, ny = 10, nz = 30, d = 1 0, mi = 60, t, wi2 = 2) 

in which t varies between 30 and 80. 

We observe in Figures [5] and [7] that using the decomposition using the elementary 
constraints (decomp model) is not efficient (note the log scale). The instances solved 
here (classes C and D) are much smaller than those used for propagation (classes A 
and B). Solving larger instances was impractical. This second experiment shows that 
Range can reasonably solve problems containing Uses constraints. It also shows the 
clear benefit of using our algorithm in preference to the decomposition using elementary 
constraints over the under-constrained region. As the problems get over-constrained, 
the binary constraints dominate the pruning, and the algorithm has a slight overhead in 
run-time, pruning the same as the decomposition using elementary constraints. 
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Figure 6: Solving random binary constraint satisfaction problems with two overlapping 
Uses constraints (class C). 



7.2.2 Problem of the courses in the master of computer science 

To confirm the results obtained on several types of random instances, we tackle the 
problem of deciding which courses to run in the Master of Computer Science at the 
University of Montpcllicr. This problem, which is usually solved by hand with the help 
of an Excel program, can be specified as follows. The second year of the Master of 
Computer Science advertises a set C of possible courses. There is a set L of n lecturers 
who have skills to teach some subset of the courses (between 1 and 9 per lecturer) . There 
is a set S of m students who bid for which courses they would like to attend (between 
6 and 10 bids per student). A course runs only if at least 5 students bid for it. Every 
lecturer participates in just one course, but several lecturers can be assigned to the same 
course. There is also a set P C L of professors who are in charge of the course in which 
they participate. The goal is to run enough courses so that all lecturers are assigned to 
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Figure 7: Solving random binary constraint satisfaction problems with two disjoint Uses 
constraints (class D). 



one course and all students can attend at least one of the courses for which they bid. 

The models we used have variables Li representing which course is taught by lecturer i 
and variables Sj representing one of the courses student j wants to attend. D(Li) contains 
all courses lecturer i can teach except those that received less than 5 bids. D(Sj) contains 
all courses student j has bid for, except those that received less than 5 bids. We put a con- 
straint USES([ii, . . . , L n ], [Si, . . . , S m ]) an d a constraint AllDifferent^L^ , . . . , L ip ) 
where {Li x , . . . , Li } = P. Model range decomposes Uses with Range, and model 
decomp decomposes Uses with primitive constraints as described in Section [7.2. II 

In the only instance we could obtain from the university, year-2008, there are 50 
lecturers, 26 professors, 53 courses, and 177 students. We solved year-2008, both with 
model decomp and with model range. Both models could find a solution in a few 
milliseconds. 

We modified the two models so that the satisfaction of the students is improved. 
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Instead of trying to satisfy only one of their choices, we try now to satisfy k choices. The 
models are modified in the following way. We create fc copies of each variable Sj, that 
is, Sj,Sj, . . . , Sj, with D(Sj) containing the same values as D(Sj) (see above). We post 
constraints Sj < Sj < . . . < Sj that break symmetries and guarantee that Sj,Sj,...,Sj 
all take different values. Then, instead of having a single Uses constraint, we have k Uses 
constraints, one on each set S\, 5|, . . . , S l m of variables: Uses([£i, . . . , L n ], [Si, . . . , S^J), 
. . ., Uses([Li, ...,L n ], [Si, .. ., S*J)- Model range-fc decomposes Uses with Range, 
and model decomp-fc decomposes Uses with primitive constraints as described in Section 

We solved instance year-2008 with k = 2,3, 4, 5. When k = 2 or k = 3, both models 
find a solution in a few milliseconds, decomp-fc being slightly faster than range-fc. 
range-4 finds a solution in 4 fails and 5.83 sec. whereas decomp-4 was stopped after 24 
hours without finding any solution, range-5 and decomp-5 were stopped after 24 hours 
without finding any solution or proving that none exists. This experiment shows that it 
can be effective to solve a real-world problem containing a global constraint like Uses by 
specifying it with Range instead of using a decomposition with elementary constraints. 

7.3 Solving problems using Range and Roots 

In Section 17.2.21 we showed how decomposing a global constraint with Range can be 
useful to solve a real- world problem. In this subsection we study another real- world prob- 
lem that involves a greater variety of global constraints, some allowing decompositions 
with Range, some others with Roots. More importantly, we will compare monolithic 
propagators of existing well-known global constraints with their decompositions using 
Range and ROOTS. The purpose of this subsection is to see if solving real-world con- 
straint problems using Range and ROOTS leads to acceptable performance compared to 
specialised global constraints and their propagators. 

We used a model for the Mystery Shopper problem pi)] due to Helmut Simonis that 
appears in CSPLib (prob004). We used the same problem instances as in [10] but perform 
a more thorough and extensive analysis. We partition the constraints of this problem 
into three groups: 

Temporal and geographical: All visits for any week are made by different shoppers. 

Similarly, a particular area cannot be visited more than once by the same shopper. 
Shopper: Each shopper makes exactly the required number of visits. 

Saleslady: A saleslady must be visited by some shoppers from at least 2 different groups 
(the shoppers are partitioned into groups). 

The first group of constraints can be modelled by using AllDifferent constraints 
[25] , the second can be modelled by Gcc [5S] and the third by Among constraints [3]. We 
experimented with several models using Hog Solver where these constraints are either im- 
plemented as their Hog Solver primitives (respectively, IloAllDif f , IloDistribute. 
and a decomposition using IloSum on Boolean variables) or as their decompositions with 
Range and Roots. The decomposition of Among([Xi, ..,X n ], [di, ..,d m ],N) we use is 
the one presented in [6], that is, (Bi = 1 Xi € [di, .., d m ]), \/i 6 l--"-A^ Bi = N. Note 
that this decomposition of the Among constraint maintains GAC in theory [6|. This 
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decomposition can be implemented in many solvers using disjunctions of membership con- 
straints: or(notmember(Xi, [d\, .., d m ]), Bi = 1) and or (member (Xi, [di, .., d m \), Bi = 0). 
Unfortunately, Hog Solver does not appear to achieve GAC on such disjunctions of prim- 
itives because the negated membership constraint notmember(Xi, [d\, .., d m \) is activated 
only if Xi is instantiated with a value in [di,..,d m ] whereas it should be as soon as 
D(Xi) C [di,..,^]. 

We report results for the following representative models: 

• Alld-Gcc-Sum uses only Hog Solver primitives; 

• Alld-Gcc-ROOTS where Among is encoded using ROOTS; 

• Alld-ROOTS-Sum where Gcc is encoded using ROOTS; 

• Range- Gcc- Sum where AllDifferent is encoded using RANGE; 

• Al id- Roots- Roots where Among and Gcc are encoded using Roots; 

Note that Among encoded as ROOTS uses the decomposition presented in Section 
16.2.21 the Gcc uses the decomposition presented in Section [5.2.11 and AllDifferent 
uses the decomposition presented in Section f6. 1.1 1 

We study the following important questions: 

• How does the Roots decomposition of the Among constraint compare to the Sum 
decomposition in terms of pruning and run-times? 

• Does the decomposition of Gcc using Roots lead to a reasonable and acceptable 
loss in performance? 

• Does the decomposition of AllDifferent using Range lead to a reasonable and 
acceptable loss in performance? 

• Do we gain in performance by branching on the set variables introduced by the 
ROOTS decomposition? 

To answer the first question, we will compare the model Alld-Gcc-Sum against the 
model Alld-Gcc-ROOTS. To answer the second question, we will compare the model 
Alld-Gcc-Sum against the model Alld-ROOTS-Sum. To answer the third question, we 
will compare the model Alld-Gcc-Sum against the model RANGE-Gcc-Sum. To answer 
the fourth question, we will compare Alld-Gcc-Sum against the model Alld-ROOTS- 
ROOTS that branches on the set variables. 

The instances we use in the experiments are generated as follows. For each number 
of salesladies s € {10, 15,20,25,30,35}, we generate \(s + 2/4) * 4] shoppers, 4 visits. 
Furthermore, to determine the partitioning of the outlets, we bound the number of 
salesladies per outlet between a lower bound and an upper bound and generate all possible 
partitions within these bounds. The number of instances for each class is as follows; for 10 
salesladies we have 10 instances, for 15 salesladies we have 52 instances, for 20 salesladies 
we have 35 instances, for 25 salesladies we have 20 instances, for 30 salesladies we have 
10 instances, and for 35 salesladies we have 56 instances. 

We also tested two variable and value ordering heuristics: 
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Table 1: The sum decomposition of Among in the Mystery Shopper problem versus the 



ROOTS decomposition using lex as a branching strategy. 
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10 
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0.89 
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431.55 
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281.90 
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10.60 
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0.02 
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9.48 


9.48 
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16 
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7.06 


7.06 


16 


0.04 


0.04 


7.00 


7.00 


30 
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0.05 


0.05 


50.00 


50.00 


6 


0.07 


0.07 


49.67 


49.67 


35 


31 


0.23 


0.23 


414.68 


414.68 


31 


0.24 


0.24 


269.32 


269.32 



• We branch on the variables with the minimum domain first and assign values 
lexicographically. We refer to this as dom; 

• We assign a shopper to each saleslady for the first, then for the second week and 
so on. This a static variables and value ordering heuristic. We refer to this as lex. 

However, since lex was consistently better than dom we only report the results using 
lex. 

All instances solved in the experiments use a time limit of 5 minutes. For each class 
of instances we report the number of instances solved (#solved), the average cpu-time 
in seconds over all instances solved by the method (by self), the average cpu-time in 
seconds over all instances solved by both methods (by all) , the average number of failures 
over all instances solved by the method (by self), the average number of failures over 
all instances solved by both methods (by all). 

7.3.1 Among 

When branching on the integer variables using lex (Table [T]) strategy, the Alld-Gcc- 
ROOTS model tends to perform better than the Al ld-Gcc-Sum model in terms of pruning 
(smaller number of fails). Note that the Sum decomposition misses some pruning because 
of the Hog Solver propagators used in this decomposition, as explained at the beginning 
of Section 17.31 This explains the discrepancy. Both models solve the same number of 
instances. The results show that in this case of the Among constraint, our Roots 
decomposition is as efficient as the decomposition using elementary Sum constraints. 
Minor run-time differences are probably due to the cheaper propagator of Hog Solver 
which achieves less pruning. 

7.3.2 Gcc 

The Gcc constraint is one of the most efficient and effective global constraints available 
in most constraint toolkits. The results comparing the Alld-Gcc-Sum model versus its 
equivalent (the Alld-ROOTS-Sum model) where instead of GCC constraints we use our 
decomposition using ROOTS are shown in Table [H We observe that when branching on 
the integer variables using lex, the loss in terms of pruning due to our decomposition 
is very low: the difference in number of fails is less than 5% on the hardest instances. 
This means that our decomposition should scale well when size and difficulty of problems 
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Table 2: The Gcc constraints in the Mystery Shopper problem versus the Roots de- 



composition using lex as a branching strategy. 
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Table 3: The AllDifferent constraints in the Mystery Shopper problem versus the 



Range decomposition using lex as a branching strategy. 
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increases. The difference in run-times is larger (up to more than one order of magni- 
tude). This can be explained in part by the propagation algorithms for Range and 
ROOTS that we have implemented in Hog Solver. They are far from being optimised, as 
opposed to the highly specialised native Gcc propagator. Overall, the loss appears to 
be acceptable. Our results show that, for the Gcc constraint, the decomposition into 
ROOTS leads to adequate performance for prototyping. Nevertheless, providing more 
efficient propagators for Roots is an interesting and open issue. 

7.3.3 Alldifferent 

The AllDifferent constraint is again one of the most efficient and effective global 
constraints available in most constraint toolkits. The results comparing the Alld- 
Gcc-Sum model versus its equivalent (the RANGE-Gcc-Sum model) where instead of 
AllDifferent constraints we use our decomposition using Range are shown in Ta- 
ble [31 We observe that when branching on the integer variables using lex both methods 
achieve the same amount of pruning even if we are not in a case where AllDifferent 
constraints are Permutation constraints (see Section I6.1.1[) . This means that even 
when our decomposition using Range theoretically hinders propagation, it can in prac- 
tice achieve GAC. Concerning run-time efficiency, we observe that both methods solve 
the same number of instances. This is probably a consequence of the good level of pruning 
achieved by the decomposition of AllDifferent using Range. But the Alld-Gcc-Sum 
model is usually faster, up to one order of magnitude in the extreme case. Again, this can 
be explained in part by our basic implementation of the Range and Roots propagators 
in Hog Solver, as opposed to the highly specialised native AllDifferent propagator. 
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Table 4: Branching on set variables in the Mystery Shoppers Problem 
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7.3.4 Exploiting the set variables 

In the previous subsections, we have seen that decomposing global constraints with 
Range and ROOTS constraints is a viable approach. Such decompositions generally 
give very small (if any) loss in terms of pruning and they give acceptable run-time per- 
formance. However, we have seen that our basic decomposition using Roots can be slow 
compared to highly specialised propagators such as those used by Hog Solver for the Gcc 
constraint. In this subsection, we show that, even without optimising our code, we can 
improve the run-time performance of our decomposition just by exploiting its internal 
structure through the extra variables it introduces. 

The decomposition of global constraints using Range and Roots introduces extra 
set variables. We here explore the possibility of branching on the set variables as follows. 
We branch on the set variables first, then on the integer variables with min domain once 
all set variables are instantiated. We refer to this as set. We compare the best model that 
uses the available constraints in Hog Solver (model Alld-Gcc-Sum) versus the best model 
that branches on the set variables (model Alld-ROOTS-ROOTS, in which the Among 
and the Gcc constraints are expressed using the Roots constraint). Surprisingly, we 
solve significantly more instances when branching on the set variables than the model 
Alld-Gcc-Sum. But, again, Alld-Gcc-Sum is a more efficient model when it manages 
to solve the instance. 

These results are primarily due to the better branching strategy. However, such a 
strategy would not be easily implementable without Roots since the extra set variables 
are part of it. We observe here that the extra set variables introduced by the Roots 
decomposition may provide new possibilities for branching strategies that might be ben- 
eficial in practice. 

These results show that by simply changing the branching strategy so that it exploits 
the internal structure of the decompositions, we obtain a significant increase in perfor- 
mance. This gain compensates the loss in cpu-time caused by the preliminary nature of 
our implementation. 

8 Conclusion 

We have proposed two global constraints useful in specifying many counting and oc- 
currence constraints: the Range constraint which computes the range of values used 
by a set of variables, and the ROOTS constraint which computes the variables in a set 
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mapping onto particular values. These two constraints capture the notion of image and 
domain of a function, making them easy to understand to the non expert in constraint 
programming. We have shown that these two constraints can easily specify counting 
and occurrence constraints. For example, the open versions of some well-known global 
constraints can be specified with Range and Roots. Beyond counting and occurrence 
constraints, we have shown that the expressive power of Range and Roots allows them 
to specify many other constraints. 

We have proposed propagation algorithms for these two constraints. Hence, any 
global constraint specified using Range and Roots can be propagated. In some cases, 
this gives a propagation algorithm which achieves GAC on the original global constraint 
(e.g. the Permutation and Among constraints). In other cases, this propagation 
algorithm may not make the original constraint GAC, but achieving GAC is NP-hard 
(e.g. the NValue and Common constraints). Decomposition is then one method to 
obtain a polynomial algorithm. In the remaining cases, the propagation algorithm may 
not make the constraint GAC, although specialised propagation algorithms can do so in 
polynomial time (e.g. the SymAllDiff constraint). Our method can still be attractive 
in this last case as it provides a generic means of propagation for counting and occurrence 
constraints when specialised algorithms have not yet been proposed or are not available 
in the constraint toolkit. 

We have presented a comprehensive study of the Range constraint. We proposed an 
algorithm for enforcing hybrid consistency on Range. We also have presented a com- 
prehensive study of the Roots constraint. We proved that propagating completely the 
ROOTS constraint is intractable in general. We therefore proposed a decomposition to 
propagate it partially. This decomposition achieves hybrid consistency on the Roots 
constraint under some simple conditions often met in practice. In addition, enforcing 
bound consistency on the decomposition achieves bound consistency on the Roots con- 
straint whatever conditions hold. 

Our experiments show the benefit we can obtain by incorporating the Range and the 
ROOTS constraints in a constraint toolkit. First, despite being intractable, the Roots 
constraint can be propagated using the decomposition we presented. Even if this de- 
composition hinders propagation in theory, our experiments show that it is seldom the 
case in practice. Second, in the absence of specialised propagation algorithms, Range 
and Roots appear to be a simple and a reasonable method for propagating (possibly 
intractable) global constraints that is competitive to other decompositions into more el- 
ementary constraints. Our experiments show that sometimes we do better than these 
other decompositions either in terms of pruning or in solution time or both (like the case 
of the decomposition of the Uses constraint). In addition, compared to highly specialised 
propagation algorithms like those for the AllDifferent and Gcc constraints in Hog 
Solver, the loss in performance when using Range and ROOTS was not great. Thus, 
if the constraint toolkit lacks a specialised propagation algorithm, Range and Roots 
offer a quick, easy, and acceptable way of propagation. Finally, we observed that the 
extra set variables introduced in Range and Roots decompositions can be exploited 
in the design of new branching strategics. These extra set variables may provide both a 
modelling and solving advantage to the user. We hope that by presenting these results, 
developers of the many different constraint toolkits will be encouraged to include the 
Range and Roots constraints into their solvers. 
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